Overview
Brought to you by YData
Dataset statistics
| Number of variables | 33 |
|---|---|
| Number of observations | 58895 |
| Missing cells | 126083 |
| Missing cells (%) | 6.5% |
| Duplicate rows | 3822 |
| Duplicate rows (%) | 6.5% |
| Total size in memory | 53.5 MiB |
| Average record size in memory | 952.9 B |
Variable types
| Categorical | 15 |
|---|---|
| Numeric | 15 |
| Text | 1 |
| Unsupported | 1 |
| DateTime | 1 |
| Dataset has 3822 (6.5%) duplicate rows | Duplicates |
agent is highly overall correlated with hotel | High correlation |
arrival_date_month is highly overall correlated with arrival_date_week_number | High correlation |
arrival_date_week_number is highly overall correlated with arrival_date_month | High correlation |
assigned_room_type is highly overall correlated with reserved_room_type | High correlation |
distribution_channel is highly overall correlated with market_segment | High correlation |
hotel is highly overall correlated with agent | High correlation |
is_canceled is highly overall correlated with reservation_status | High correlation |
market_segment is highly overall correlated with distribution_channel | High correlation |
reservation_status is highly overall correlated with is_canceled | High correlation |
reserved_room_type is highly overall correlated with assigned_room_type | High correlation |
children is highly imbalanced (79.9%) | Imbalance |
meal is highly imbalanced (53.5%) | Imbalance |
distribution_channel is highly imbalanced (59.7%) | Imbalance |
is_repeated_guest is highly imbalanced (80.5%) | Imbalance |
reserved_room_type is highly imbalanced (51.4%) | Imbalance |
deposit_type is highly imbalanced (70.6%) | Imbalance |
required_car_parking_spaces is highly imbalanced (80.2%) | Imbalance |
agent has 9132 (15.5%) missing values | Missing |
company has 55416 (94.1%) missing values | Missing |
customer_type has 589 (1.0%) missing values | Missing |
required_car_parking_spaces has 589 (1.0%) missing values | Missing |
reservation_status has 589 (1.0%) missing values | Missing |
kids has 58694 (99.7%) missing values | Missing |
adults is highly skewed (γ1 = 24.81617275) | Skewed |
babies is highly skewed (γ1 = 25.35395751) | Skewed |
previous_cancellations is highly skewed (γ1 = 21.14836957) | Skewed |
company is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lead_time has 3700 (6.3%) zeros | Zeros |
stays_in_weekend_nights has 23496 (39.9%) zeros | Zeros |
stays_in_week_nights has 3603 (6.1%) zeros | Zeros |
babies has 58094 (98.6%) zeros | Zeros |
previous_cancellations has 57800 (98.1%) zeros | Zeros |
previous_bookings_not_canceled has 56863 (96.5%) zeros | Zeros |
booking_changes has 49268 (83.7%) zeros | Zeros |
days_in_waiting_list has 56503 (95.9%) zeros | Zeros |
adr has 954 (1.6%) zeros | Zeros |
total_of_special_requests has 37158 (63.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-09-19 15:03:44.057385 |
|---|---|
| Analysis finished | 2025-09-19 15:04:20.275442 |
| Duration | 36.22 seconds |
| Software version | ydata-profiling v4.16.1 |
| Download configuration | config.json |
Variables
hotel
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.8 MiB |
| Resort Hotel | |
|---|---|
| City Hotel |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 11.360489 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Resort Hotel |
|---|---|
| 2nd row | Resort Hotel |
| 3rd row | Resort Hotel |
| 4th row | Resort Hotel |
| 5th row | Resort Hotel |
Common Values
| Value | Count | Frequency (%) |
| Resort Hotel | 40063 | |
| City Hotel | 18832 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hotel | 58895 | |
| resort | 40063 | |
| city | 18832 | 16.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 117790 | |
| e | 98958 | |
| o | 98958 | |
| 58895 | ||
| H | 58895 | |
| l | 58895 | |
| R | 40063 | 6.0% |
| s | 40063 | 6.0% |
| r | 40063 | 6.0% |
| C | 18832 | 2.8% |
| Other values (2) | 37664 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 492391 | |
| Uppercase Letter | 117790 | 17.6% |
| Space Separator | 58895 | 8.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 117790 | |
| e | 98958 | |
| o | 98958 | |
| l | 58895 | |
| s | 40063 | 8.1% |
| r | 40063 | 8.1% |
| i | 18832 | 3.8% |
| y | 18832 | 3.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 58895 | |
| R | 40063 | |
| C | 18832 | 16.0% |
Space Separator
| Value | Count | Frequency (%) |
| 58895 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 610181 | |
| Common | 58895 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 117790 | |
| e | 98958 | |
| o | 98958 | |
| H | 58895 | |
| l | 58895 | |
| R | 40063 | 6.6% |
| s | 40063 | 6.6% |
| r | 40063 | 6.6% |
| C | 18832 | 3.1% |
| i | 18832 | 3.1% |
Common
| Value | Count | Frequency (%) |
| 58895 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 669076 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 117790 | |
| e | 98958 | |
| o | 98958 | |
| 58895 | ||
| H | 58895 | |
| l | 58895 | |
| R | 40063 | 6.0% |
| s | 40063 | 6.0% |
| r | 40063 | 6.0% |
| C | 18832 | 2.8% |
| Other values (2) | 37664 | 5.6% |
is_canceled
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 34666 | |
| 1 | 24229 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 34666 | |
| 1 | 24229 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 34666 | |
| 1 | 24229 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 58895 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 34666 | |
| 1 | 24229 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 58895 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 34666 | |
| 1 | 24229 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58895 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 34666 | |
| 1 | 24229 |
lead_time
Real number (ℝ)
Zeros 
| Distinct | 428 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.0509 |
| Minimum | 0 |
|---|---|
| Maximum | 737 |
| Zeros | 3700 |
| Zeros (%) | 6.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 17 |
| median | 69 |
| Q3 | 157 |
| 95-th percentile | 309 |
| Maximum | 737 |
| Range | 737 |
| Interquartile range (IQR) | 140 |
Descriptive statistics
| Standard deviation | 101.16242 |
|---|---|
| Coefficient of variation (CV) | 1.0111095 |
| Kurtosis | 1.0087856 |
| Mean | 100.0509 |
| Median Absolute Deviation (MAD) | 61 |
| Skewness | 1.2082573 |
| Sum | 5892498 |
| Variance | 10233.835 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3700 | 6.3% |
| 1 | 1923 | 3.3% |
| 2 | 1106 | 1.9% |
| 3 | 941 | 1.6% |
| 4 | 830 | 1.4% |
| 5 | 745 | 1.3% |
| 7 | 687 | 1.2% |
| 6 | 672 | 1.1% |
| 12 | 527 | 0.9% |
| 10 | 519 | 0.9% |
| Other values (418) | 47245 |
| Value | Count | Frequency (%) |
| 0 | 3700 | |
| 1 | 1923 | |
| 2 | 1106 | 1.9% |
| 3 | 941 | 1.6% |
| 4 | 830 | 1.4% |
| 5 | 745 | 1.3% |
| 6 | 672 | 1.1% |
| 7 | 687 | 1.2% |
| 8 | 503 | 0.9% |
| 9 | 477 | 0.8% |
| Value | Count | Frequency (%) |
| 737 | 1 | < 0.1% |
| 709 | 1 | < 0.1% |
| 605 | 9 | < 0.1% |
| 542 | 23 | |
| 532 | 1 | < 0.1% |
| 471 | 6 | < 0.1% |
| 468 | 47 | |
| 462 | 20 | |
| 461 | 32 | |
| 460 | 3 | < 0.1% |
arrival_date_year
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 588 |
| Missing (%) | 1.0% |
| Memory size | 3.5 MiB |
| 2016.0 | |
|---|---|
| 2015.0 | |
| 2017.0 | |
| 20016.0 | 614 |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.0105305 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015.0 |
|---|---|
| 2nd row | 2015.0 |
| 3rd row | 2015.0 |
| 4th row | 2015.0 |
| 5th row | 2015.0 |
Common Values
| Value | Count | Frequency (%) |
| 2016.0 | 30105 | |
| 2015.0 | 14537 | |
| 2017.0 | 13051 | |
| 20016.0 | 614 | 1.0% |
| (Missing) | 588 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2016.0 | 30105 | |
| 2015.0 | 14537 | |
| 2017.0 | 13051 | |
| 20016.0 | 614 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 117228 | |
| 2 | 58307 | |
| 1 | 58307 | |
| . | 58307 | |
| 6 | 30719 | 8.8% |
| 5 | 14537 | 4.1% |
| 7 | 13051 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 292149 | |
| Other Punctuation | 58307 | 16.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 117228 | |
| 2 | 58307 | |
| 1 | 58307 | |
| 6 | 30719 | 10.5% |
| 5 | 14537 | 5.0% |
| 7 | 13051 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 58307 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 350456 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 117228 | |
| 2 | 58307 | |
| 1 | 58307 | |
| . | 58307 | |
| 6 | 30719 | 8.8% |
| 5 | 14537 | 4.1% |
| 7 | 13051 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 350456 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 117228 | |
| 2 | 58307 | |
| 1 | 58307 | |
| . | 58307 | |
| 6 | 30719 | 8.8% |
| 5 | 14537 | 4.1% |
| 7 | 13051 | 3.7% |
arrival_date_month
Categorical
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
| August | |
|---|---|
| September | |
| July | |
| October | |
| May | |
| Other values (7) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.0207148 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | July |
|---|---|
| 2nd row | July |
| 3rd row | July |
| 4th row | July |
| 5th row | July |
Common Values
| Value | Count | Frequency (%) |
| August | 7715 | |
| September | 6712 | |
| July | 6177 | |
| October | 6040 | |
| May | 5283 | |
| April | 5185 | |
| June | 4725 | |
| March | 4492 | |
| February | 3830 | |
| December | 3121 | |
| Other values (2) | 5615 |
Length
| Value | Count | Frequency (%) |
| august | 7715 | |
| september | 6712 | |
| july | 6177 | |
| october | 6040 | |
| may | 5283 | |
| april | 5185 | |
| june | 4725 | |
| march | 4492 | |
| february | 3830 | |
| december | 3121 | |
| Other values (2) | 5615 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 49808 | |
| r | 38825 | 10.9% |
| u | 32920 | 9.3% |
| b | 22560 | 6.4% |
| t | 20467 | 5.8% |
| a | 19121 | 5.4% |
| y | 18048 | 5.1% |
| J | 13660 | 3.9% |
| c | 13653 | 3.9% |
| A | 12900 | 3.6% |
| Other values (16) | 112628 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 295695 | |
| Uppercase Letter | 58895 | 16.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 49808 | |
| r | 38825 | |
| u | 32920 | |
| b | 22560 | |
| t | 20467 | 6.9% |
| a | 19121 | 6.5% |
| y | 18048 | 6.1% |
| c | 13653 | 4.6% |
| m | 12690 | 4.3% |
| p | 11897 | 4.0% |
| Other values (8) | 55706 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 13660 | |
| A | 12900 | |
| M | 9775 | |
| S | 6712 | |
| O | 6040 | |
| F | 3830 | 6.5% |
| D | 3121 | 5.3% |
| N | 2857 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 354590 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 49808 | |
| r | 38825 | 10.9% |
| u | 32920 | 9.3% |
| b | 22560 | 6.4% |
| t | 20467 | 5.8% |
| a | 19121 | 5.4% |
| y | 18048 | 5.1% |
| J | 13660 | 3.9% |
| c | 13653 | 3.9% |
| A | 12900 | 3.6% |
| Other values (16) | 112628 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 354590 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 49808 | |
| r | 38825 | 10.9% |
| u | 32920 | 9.3% |
| b | 22560 | 6.4% |
| t | 20467 | 5.8% |
| a | 19121 | 5.4% |
| y | 18048 | 5.1% |
| J | 13660 | 3.9% |
| c | 13653 | 3.9% |
| A | 12900 | 3.6% |
| Other values (16) | 112628 |
arrival_date_week_number
Real number (ℝ)
High correlation 
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.837389 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 17 |
| median | 29 |
| Q3 | 38 |
| 95-th percentile | 49 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 13.346053 |
|---|---|
| Coefficient of variation (CV) | 0.47942904 |
| Kurtosis | -0.94526161 |
| Mean | 27.837389 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.13277996 |
| Sum | 1639483 |
| Variance | 178.11712 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 2023 | 3.4% |
| 34 | 1703 | 2.9% |
| 41 | 1668 | 2.8% |
| 38 | 1663 | 2.8% |
| 32 | 1620 | 2.8% |
| 42 | 1617 | 2.7% |
| 37 | 1555 | 2.6% |
| 40 | 1513 | 2.6% |
| 35 | 1504 | 2.6% |
| 30 | 1496 | 2.5% |
| Other values (43) | 42533 |
| Value | Count | Frequency (%) |
| 1 | 402 | 0.7% |
| 2 | 573 | |
| 3 | 675 | |
| 4 | 687 | |
| 5 | 591 | |
| 6 | 802 | |
| 7 | 1073 | |
| 8 | 882 | |
| 9 | 950 | |
| 10 | 996 |
| Value | Count | Frequency (%) |
| 53 | 807 | |
| 52 | 655 | |
| 51 | 467 | |
| 50 | 508 | |
| 49 | 844 | |
| 48 | 710 | |
| 47 | 814 | |
| 46 | 548 | |
| 45 | 750 | |
| 44 | 1000 |
arrival_date_day_of_month
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.766432 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.7830365 |
|---|---|
| Coefficient of variation (CV) | 0.55707192 |
| Kurtosis | -1.1763513 |
| Mean | 15.766432 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.021266696 |
| Sum | 928564 |
| Variance | 77.14173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 2253 | 3.8% |
| 12 | 2230 | 3.8% |
| 16 | 2211 | 3.8% |
| 18 | 2118 | 3.6% |
| 17 | 2117 | 3.6% |
| 30 | 2112 | 3.6% |
| 26 | 2096 | 3.6% |
| 9 | 2092 | 3.6% |
| 15 | 2030 | 3.4% |
| 25 | 2020 | 3.4% |
| Other values (21) | 37616 |
| Value | Count | Frequency (%) |
| 1 | 1750 | |
| 2 | 1997 | |
| 3 | 1850 | |
| 4 | 1850 | |
| 5 | 2253 | |
| 6 | 1782 | |
| 7 | 1846 | |
| 8 | 1908 | |
| 9 | 2092 | |
| 10 | 1718 |
| Value | Count | Frequency (%) |
| 31 | 1186 | |
| 30 | 2112 | |
| 29 | 1712 | |
| 28 | 1820 | |
| 27 | 1711 | |
| 26 | 2096 | |
| 25 | 2020 | |
| 24 | 1978 | |
| 23 | 1767 | |
| 22 | 1810 |
stays_in_weekend_nights
Real number (ℝ)
Zeros 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0581543 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 23496 |
| Zeros (%) | 39.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.0930323 |
|---|---|
| Coefficient of variation (CV) | 1.0329612 |
| Kurtosis | 7.4681056 |
| Mean | 1.0581543 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.4335048 |
| Sum | 62320 |
| Variance | 1.1947197 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 23496 | |
| 2 | 18437 | |
| 1 | 14032 | |
| 4 | 1640 | 2.8% |
| 3 | 1026 | 1.7% |
| 6 | 128 | 0.2% |
| 5 | 51 | 0.1% |
| 8 | 42 | 0.1% |
| 7 | 18 | < 0.1% |
| 9 | 8 | < 0.1% |
| Other values (7) | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 23496 | |
| 1 | 14032 | |
| 2 | 18437 | |
| 3 | 1026 | 1.7% |
| 4 | 1640 | 2.8% |
| 5 | 51 | 0.1% |
| 6 | 128 | 0.2% |
| 7 | 18 | < 0.1% |
| 8 | 42 | 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 19 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 16 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 12 | 5 | < 0.1% |
| 10 | 5 | < 0.1% |
| 9 | 8 | < 0.1% |
| 8 | 42 | |
| 7 | 18 |
stays_in_week_nights
Real number (ℝ)
Zeros 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8475762 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 3603 |
| Zeros (%) | 6.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.2354858 |
|---|---|
| Coefficient of variation (CV) | 0.78504864 |
| Kurtosis | 18.796619 |
| Mean | 2.8475762 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.5680997 |
| Sum | 167708 |
| Variance | 4.9973969 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 14539 | |
| 1 | 13621 | |
| 3 | 9677 | |
| 5 | 8524 | |
| 4 | 4862 | 8.3% |
| 0 | 3603 | 6.1% |
| 6 | 1220 | 2.1% |
| 10 | 936 | 1.6% |
| 7 | 887 | 1.5% |
| 8 | 532 | 0.9% |
| Other values (23) | 494 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 3603 | 6.1% |
| 1 | 13621 | |
| 2 | 14539 | |
| 3 | 9677 | |
| 4 | 4862 | 8.3% |
| 5 | 8524 | |
| 6 | 1220 | 2.1% |
| 7 | 887 | 1.5% |
| 8 | 532 | 0.9% |
| 9 | 179 | 0.3% |
| Value | Count | Frequency (%) |
| 50 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 34 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 30 | 4 | |
| 26 | 1 | < 0.1% |
| 25 | 5 | |
| 24 | 1 | < 0.1% |
adults
Real number (ℝ)
Skewed 
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9691485 |
| Minimum | -1 |
|---|---|
| Maximum | 100 |
| Zeros | 104 |
| Zeros (%) | 0.2% |
| Negative | 99 |
| Negative (%) | 0.2% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 100 |
| Range | 101 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.9434539 |
|---|---|
| Coefficient of variation (CV) | 1.4947851 |
| Kurtosis | 657.97785 |
| Mean | 1.9691485 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.816173 |
| Sum | 115973 |
| Variance | 8.6639207 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 45712 | |
| 1 | 10610 | 18.0% |
| 3 | 2225 | 3.8% |
| 0 | 104 | 0.2% |
| -1 | 99 | 0.2% |
| 4 | 34 | 0.1% |
| 66 | 6 | < 0.1% |
| 65 | 5 | < 0.1% |
| 26 | 5 | < 0.1% |
| 69 | 4 | < 0.1% |
| Other values (44) | 91 | 0.2% |
| Value | Count | Frequency (%) |
| -1 | 99 | 0.2% |
| 0 | 104 | 0.2% |
| 1 | 10610 | 18.0% |
| 2 | 45712 | |
| 3 | 2225 | 3.8% |
| 4 | 34 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 3 | |
| 98 | 2 | |
| 96 | 2 | |
| 95 | 3 | |
| 93 | 1 | < 0.1% |
| 92 | 2 | |
| 91 | 4 | |
| 89 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
| 86 | 2 |
children
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 3.4 MiB |
| 0.0 | |
|---|---|
| 1.0 | 2335 |
| 2.0 | 2114 |
| 3.0 | 26 |
| 10.0 | 1 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.000017 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 54415 | |
| 1.0 | 2335 | 4.0% |
| 2.0 | 2114 | 3.6% |
| 3.0 | 26 | < 0.1% |
| 10.0 | 1 | < 0.1% |
| (Missing) | 4 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 54415 | |
| 1.0 | 2335 | 4.0% |
| 2.0 | 2114 | 3.6% |
| 3.0 | 26 | < 0.1% |
| 10.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 113307 | |
| . | 58891 | |
| 1 | 2336 | 1.3% |
| 2 | 2114 | 1.2% |
| 3 | 26 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 117783 | |
| Other Punctuation | 58891 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 113307 | |
| 1 | 2336 | 2.0% |
| 2 | 2114 | 1.8% |
| 3 | 26 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 58891 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 176674 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 113307 | |
| . | 58891 | |
| 1 | 2336 | 1.3% |
| 2 | 2114 | 1.2% |
| 3 | 26 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 176674 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 113307 | |
| . | 58891 | |
| 1 | 2336 | 1.3% |
| 2 | 2114 | 1.2% |
| 3 | 26 | < 0.1% |
babies
Real number (ℝ)
Skewed  Zeros 
| Distinct | 47 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.13659903 |
| Minimum | -1 |
|---|---|
| Maximum | 100 |
| Zeros | 58094 |
| Zeros (%) | 98.6% |
| Negative | 90 |
| Negative (%) | 0.2% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 100 |
| Range | 101 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.1158893 |
|---|---|
| Coefficient of variation (CV) | 22.810478 |
| Kurtosis | 665.69282 |
| Mean | 0.13659903 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 25.353958 |
| Sum | 8045 |
| Variance | 9.7087659 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 58094 | |
| 1 | 598 | 1.0% |
| -1 | 90 | 0.2% |
| 2 | 9 | < 0.1% |
| 51 | 6 | < 0.1% |
| 57 | 5 | < 0.1% |
| 77 | 5 | < 0.1% |
| 73 | 5 | < 0.1% |
| 81 | 4 | < 0.1% |
| 94 | 4 | < 0.1% |
| Other values (37) | 75 | 0.1% |
| Value | Count | Frequency (%) |
| -1 | 90 | 0.2% |
| 0 | 58094 | |
| 1 | 598 | 1.0% |
| 2 | 9 | < 0.1% |
| 10 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 51 | 6 | < 0.1% |
| 52 | 2 | < 0.1% |
| 53 | 2 | < 0.1% |
| 54 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 2 | |
| 99 | 2 | |
| 98 | 2 | |
| 97 | 4 | |
| 96 | 2 | |
| 95 | 1 | < 0.1% |
| 94 | 4 | |
| 93 | 3 | |
| 92 | 3 | |
| 91 | 1 | < 0.1% |
meal
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| BB | |
|---|---|
| HB | |
| SC | 1780 |
| Undefined | 1169 |
| FB | 790 |
Length
| Max length | 9 |
|---|---|
| Median length | 2 |
| Mean length | 2.1389422 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BB |
|---|---|
| 2nd row | BB |
| 3rd row | BB |
| 4th row | BB |
| 5th row | BB |
Common Values
| Value | Count | Frequency (%) |
| BB | 45060 | |
| HB | 10096 | 17.1% |
| SC | 1780 | 3.0% |
| Undefined | 1169 | 2.0% |
| FB | 790 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bb | 45060 | |
| hb | 10096 | 17.1% |
| sc | 1780 | 3.0% |
| undefined | 1169 | 2.0% |
| fb | 790 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 101006 | |
| H | 10096 | 8.0% |
| n | 2338 | 1.9% |
| d | 2338 | 1.9% |
| e | 2338 | 1.9% |
| S | 1780 | 1.4% |
| C | 1780 | 1.4% |
| U | 1169 | 0.9% |
| f | 1169 | 0.9% |
| i | 1169 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 116621 | |
| Lowercase Letter | 9352 | 7.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 101006 | |
| H | 10096 | 8.7% |
| S | 1780 | 1.5% |
| C | 1780 | 1.5% |
| U | 1169 | 1.0% |
| F | 790 | 0.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2338 | |
| d | 2338 | |
| e | 2338 | |
| f | 1169 | |
| i | 1169 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 125973 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 101006 | |
| H | 10096 | 8.0% |
| n | 2338 | 1.9% |
| d | 2338 | 1.9% |
| e | 2338 | 1.9% |
| S | 1780 | 1.4% |
| C | 1780 | 1.4% |
| U | 1169 | 0.9% |
| f | 1169 | 0.9% |
| i | 1169 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 125973 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 101006 | |
| H | 10096 | 8.0% |
| n | 2338 | 1.9% |
| d | 2338 | 1.9% |
| e | 2338 | 1.9% |
| S | 1780 | 1.4% |
| C | 1780 | 1.4% |
| U | 1169 | 0.9% |
| f | 1169 | 0.9% |
| i | 1169 | 0.9% |
country
Text
| Distinct | 141 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 478 |
| Missing (%) | 0.8% |
| Memory size | 3.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.9861855 |
| Min length | 2 |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PRT |
|---|---|
| 2nd row | PRT |
| 3rd row | GBR |
| 4th row | GBR |
| 5th row | GBR |
| Value | Count | Frequency (%) |
| prt | 27559 | |
| gbr | 7595 | 13.0% |
| esp | 5275 | 9.0% |
| fra | 3037 | 5.2% |
| irl | 2371 | 4.1% |
| deu | 2022 | 3.5% |
| ita | 1290 | 2.2% |
| cn | 807 | 1.4% |
| nld | 748 | 1.3% |
| bel | 733 | 1.3% |
| Other values (131) | 6980 | 11.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 42796 | |
| P | 33418 | |
| T | 29542 | |
| E | 9298 | 5.3% |
| B | 9147 | 5.2% |
| G | 7985 | 4.6% |
| S | 7221 | 4.1% |
| A | 6903 | 4.0% |
| L | 4699 | 2.7% |
| U | 4268 | 2.4% |
| Other values (16) | 19167 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 174444 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 42796 | |
| P | 33418 | |
| T | 29542 | |
| E | 9298 | 5.3% |
| B | 9147 | 5.2% |
| G | 7985 | 4.6% |
| S | 7221 | 4.1% |
| A | 6903 | 4.0% |
| L | 4699 | 2.7% |
| U | 4268 | 2.4% |
| Other values (16) | 19167 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 174444 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 42796 | |
| P | 33418 | |
| T | 29542 | |
| E | 9298 | 5.3% |
| B | 9147 | 5.2% |
| G | 7985 | 4.6% |
| S | 7221 | 4.1% |
| A | 6903 | 4.0% |
| L | 4699 | 2.7% |
| U | 4268 | 2.4% |
| Other values (16) | 19167 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 174444 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 42796 | |
| P | 33418 | |
| T | 29542 | |
| E | 9298 | 5.3% |
| B | 9147 | 5.2% |
| G | 7985 | 4.6% |
| S | 7221 | 4.1% |
| A | 6903 | 4.0% |
| L | 4699 | 2.7% |
| U | 4268 | 2.4% |
| Other values (16) | 19167 |
market_segment
Categorical
High correlation 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.7 MiB |
| Online TA | |
|---|---|
| Offline TA/TO | |
| Groups | |
| Direct | |
| Corporate | |
| Other values (3) | 278 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 8.9561423 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Direct |
|---|---|
| 2nd row | Direct |
| 3rd row | Direct |
| 4th row | Corporate |
| 5th row | Online TA |
Common Values
| Value | Count | Frequency (%) |
| Online TA | 25742 | |
| Offline TA/TO | 12455 | |
| Groups | 10399 | |
| Direct | 7400 | 12.6% |
| Corporate | 2621 | 4.5% |
| Complementary | 254 | 0.4% |
| Aviation | 22 | < 0.1% |
| Undefined | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 25742 | |
| ta | 25742 | |
| offline | 12455 | |
| ta/to | 12455 | |
| groups | 10399 | |
| direct | 7400 | 7.6% |
| corporate | 2621 | 2.7% |
| complementary | 254 | 0.3% |
| aviation | 22 | < 0.1% |
| undefined | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 64219 | |
| O | 50652 | |
| T | 50652 | |
| e | 48730 | |
| i | 45643 | |
| l | 38451 | 7.3% |
| A | 38219 | 7.2% |
| 38197 | 7.2% | |
| f | 24912 | 4.7% |
| r | 23295 | 4.4% |
| Other values (16) | 104502 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 316621 | |
| Uppercase Letter | 160199 | |
| Space Separator | 38197 | 7.2% |
| Other Punctuation | 12455 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 64219 | |
| e | 48730 | |
| i | 45643 | |
| l | 38451 | |
| f | 24912 | 7.9% |
| r | 23295 | 7.4% |
| o | 15917 | 5.0% |
| p | 13274 | 4.2% |
| s | 10399 | 3.3% |
| u | 10399 | 3.3% |
| Other values (7) | 21382 | 6.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 50652 | |
| T | 50652 | |
| A | 38219 | |
| G | 10399 | 6.5% |
| D | 7400 | 4.6% |
| C | 2875 | 1.8% |
| U | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 38197 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 12455 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 476820 | |
| Common | 50652 | 9.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 64219 | |
| O | 50652 | |
| T | 50652 | |
| e | 48730 | |
| i | 45643 | |
| l | 38451 | |
| A | 38219 | |
| f | 24912 | 5.2% |
| r | 23295 | 4.9% |
| o | 15917 | 3.3% |
| Other values (14) | 76130 |
Common
| Value | Count | Frequency (%) |
| 38197 | ||
| / | 12455 | 24.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 527472 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 64219 | |
| O | 50652 | |
| T | 50652 | |
| e | 48730 | |
| i | 45643 | |
| l | 38451 | 7.3% |
| A | 38219 | 7.2% |
| 38197 | 7.2% | |
| f | 24912 | 4.7% |
| r | 23295 | 4.4% |
| Other values (16) | 104502 |
distribution_channel
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
| TA/TO | |
|---|---|
| Direct | |
| Corporate | 3680 |
| GDS | 11 |
| Undefined | 5 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.400017 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Direct |
|---|---|
| 2nd row | Direct |
| 3rd row | Direct |
| 4th row | Corporate |
| 5th row | TA/TO |
Common Values
| Value | Count | Frequency (%) |
| TA/TO | 46358 | |
| Direct | 8841 | 15.0% |
| Corporate | 3680 | 6.2% |
| GDS | 11 | < 0.1% |
| Undefined | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ta/to | 46358 | |
| direct | 8841 | 15.0% |
| corporate | 3680 | 6.2% |
| gds | 11 | < 0.1% |
| undefined | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 92716 | |
| / | 46358 | |
| O | 46358 | |
| A | 46358 | |
| r | 16201 | 5.1% |
| e | 12531 | 3.9% |
| t | 12521 | 3.9% |
| D | 8852 | 2.8% |
| i | 8846 | 2.8% |
| c | 8841 | 2.8% |
| Other values (10) | 18452 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 197991 | |
| Lowercase Letter | 73685 | 23.2% |
| Other Punctuation | 46358 | 14.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 16201 | |
| e | 12531 | |
| t | 12521 | |
| i | 8846 | |
| c | 8841 | |
| o | 7360 | |
| a | 3680 | 5.0% |
| p | 3680 | 5.0% |
| n | 10 | < 0.1% |
| d | 10 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 92716 | |
| O | 46358 | |
| A | 46358 | |
| D | 8852 | 4.5% |
| C | 3680 | 1.9% |
| G | 11 | < 0.1% |
| S | 11 | < 0.1% |
| U | 5 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 46358 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 271676 | |
| Common | 46358 | 14.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 92716 | |
| O | 46358 | |
| A | 46358 | |
| r | 16201 | 6.0% |
| e | 12531 | 4.6% |
| t | 12521 | 4.6% |
| D | 8852 | 3.3% |
| i | 8846 | 3.3% |
| c | 8841 | 3.3% |
| o | 7360 | 2.7% |
| Other values (9) | 11092 | 4.1% |
Common
| Value | Count | Frequency (%) |
| / | 46358 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 318034 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 92716 | |
| / | 46358 | |
| O | 46358 | |
| A | 46358 | |
| r | 16201 | 5.1% |
| e | 12531 | 3.9% |
| t | 12521 | 3.9% |
| D | 8852 | 2.8% |
| i | 8846 | 2.8% |
| c | 8841 | 2.8% |
| Other values (10) | 18452 | 5.8% |
is_repeated_guest
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| 0 | |
|---|---|
| 1 | 1778 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 57117 | |
| 1 | 1778 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 57117 | |
| 1 | 1778 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 57117 | |
| 1 | 1778 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 58895 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 57117 | |
| 1 | 1778 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 58895 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 57117 | |
| 1 | 1778 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58895 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 57117 | |
| 1 | 1778 | 3.0% |
previous_cancellations
Real number (ℝ)
Skewed  Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.069190933 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 57800 |
| Zeros (%) | 98.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.1021382 |
|---|---|
| Coefficient of variation (CV) | 15.928939 |
| Kurtosis | 458.83288 |
| Mean | 0.069190933 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.14837 |
| Sum | 4075 |
| Variance | 1.2147086 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 57800 | |
| 1 | 896 | 1.5% |
| 24 | 48 | 0.1% |
| 2 | 44 | 0.1% |
| 26 | 26 | < 0.1% |
| 25 | 25 | < 0.1% |
| 19 | 19 | < 0.1% |
| 3 | 14 | < 0.1% |
| 14 | 14 | < 0.1% |
| 4 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 57800 | |
| 1 | 896 | 1.5% |
| 2 | 44 | 0.1% |
| 3 | 14 | < 0.1% |
| 4 | 6 | < 0.1% |
| 5 | 3 | < 0.1% |
| 14 | 14 | < 0.1% |
| 19 | 19 | < 0.1% |
| 24 | 48 | 0.1% |
| 25 | 25 | < 0.1% |
| Value | Count | Frequency (%) |
| 26 | 26 | < 0.1% |
| 25 | 25 | < 0.1% |
| 24 | 48 | 0.1% |
| 19 | 19 | < 0.1% |
| 14 | 14 | < 0.1% |
| 5 | 3 | < 0.1% |
| 4 | 6 | < 0.1% |
| 3 | 14 | < 0.1% |
| 2 | 44 | 0.1% |
| 1 | 896 |
previous_bookings_not_canceled
Real number (ℝ)
Zeros 
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.099617964 |
| Minimum | 0 |
|---|---|
| Maximum | 30 |
| Zeros | 56863 |
| Zeros (%) | 96.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.82916558 |
|---|---|
| Coefficient of variation (CV) | 8.3234544 |
| Kurtosis | 359.36435 |
| Mean | 0.099617964 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.059944 |
| Sum | 5867 |
| Variance | 0.68751556 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 56863 | |
| 1 | 973 | 1.7% |
| 2 | 388 | 0.7% |
| 3 | 204 | 0.3% |
| 4 | 127 | 0.2% |
| 5 | 91 | 0.2% |
| 6 | 56 | 0.1% |
| 7 | 37 | 0.1% |
| 8 | 33 | 0.1% |
| 9 | 24 | < 0.1% |
| Other values (21) | 99 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 56863 | |
| 1 | 973 | 1.7% |
| 2 | 388 | 0.7% |
| 3 | 204 | 0.3% |
| 4 | 127 | 0.2% |
| 5 | 91 | 0.2% |
| 6 | 56 | 0.1% |
| 7 | 37 | 0.1% |
| 8 | 33 | 0.1% |
| 9 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 30 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 2 | |
| 26 | 1 | < 0.1% |
| 25 | 3 | |
| 24 | 2 | |
| 23 | 2 | |
| 22 | 2 | |
| 21 | 2 |
reserved_room_type
Categorical
High correlation  Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| A | |
|---|---|
| D | |
| E | |
| G | 1649 |
| F | 1515 |
| Other values (5) | 1929 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 39143 | |
| D | 9516 | 16.2% |
| E | 5143 | 8.7% |
| G | 1649 | 2.8% |
| F | 1515 | 2.6% |
| C | 920 | 1.6% |
| H | 601 | 1.0% |
| B | 400 | 0.7% |
| L | 6 | < 0.1% |
| P | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 39143 | |
| d | 9516 | 16.2% |
| e | 5143 | 8.7% |
| g | 1649 | 2.8% |
| f | 1515 | 2.6% |
| c | 920 | 1.6% |
| h | 601 | 1.0% |
| b | 400 | 0.7% |
| l | 6 | < 0.1% |
| p | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 39143 | |
| D | 9516 | 16.2% |
| E | 5143 | 8.7% |
| G | 1649 | 2.8% |
| F | 1515 | 2.6% |
| C | 920 | 1.6% |
| H | 601 | 1.0% |
| B | 400 | 0.7% |
| L | 6 | < 0.1% |
| P | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 58895 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 39143 | |
| D | 9516 | 16.2% |
| E | 5143 | 8.7% |
| G | 1649 | 2.8% |
| F | 1515 | 2.6% |
| C | 920 | 1.6% |
| H | 601 | 1.0% |
| B | 400 | 0.7% |
| L | 6 | < 0.1% |
| P | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 58895 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 39143 | |
| D | 9516 | 16.2% |
| E | 5143 | 8.7% |
| G | 1649 | 2.8% |
| F | 1515 | 2.6% |
| C | 920 | 1.6% |
| H | 601 | 1.0% |
| B | 400 | 0.7% |
| L | 6 | < 0.1% |
| P | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58895 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 39143 | |
| D | 9516 | 16.2% |
| E | 5143 | 8.7% |
| G | 1649 | 2.8% |
| F | 1515 | 2.6% |
| C | 920 | 1.6% |
| H | 601 | 1.0% |
| B | 400 | 0.7% |
| L | 6 | < 0.1% |
| P | 2 | < 0.1% |
assigned_room_type
Categorical
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| A | |
|---|---|
| D | |
| E | |
| C | 2225 |
| F | 2177 |
| Other values (7) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 31391 | |
| D | 13336 | |
| E | 5927 | 10.1% |
| C | 2225 | 3.8% |
| F | 2177 | 3.7% |
| G | 1917 | 3.3% |
| B | 821 | 1.4% |
| H | 712 | 1.2% |
| I | 363 | 0.6% |
| K | 23 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| a | 31391 | |
| d | 13336 | |
| e | 5927 | 10.1% |
| c | 2225 | 3.8% |
| f | 2177 | 3.7% |
| g | 1917 | 3.3% |
| b | 821 | 1.4% |
| h | 712 | 1.2% |
| i | 363 | 0.6% |
| k | 23 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 31391 | |
| D | 13336 | |
| E | 5927 | 10.1% |
| C | 2225 | 3.8% |
| F | 2177 | 3.7% |
| G | 1917 | 3.3% |
| B | 821 | 1.4% |
| H | 712 | 1.2% |
| I | 363 | 0.6% |
| K | 23 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 58895 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 31391 | |
| D | 13336 | |
| E | 5927 | 10.1% |
| C | 2225 | 3.8% |
| F | 2177 | 3.7% |
| G | 1917 | 3.3% |
| B | 821 | 1.4% |
| H | 712 | 1.2% |
| I | 363 | 0.6% |
| K | 23 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 58895 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 31391 | |
| D | 13336 | |
| E | 5927 | 10.1% |
| C | 2225 | 3.8% |
| F | 2177 | 3.7% |
| G | 1917 | 3.3% |
| B | 821 | 1.4% |
| H | 712 | 1.2% |
| I | 363 | 0.6% |
| K | 23 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58895 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 31391 | |
| D | 13336 | |
| E | 5927 | 10.1% |
| C | 2225 | 3.8% |
| F | 2177 | 3.7% |
| G | 1917 | 3.3% |
| B | 821 | 1.4% |
| H | 712 | 1.2% |
| I | 363 | 0.6% |
| K | 23 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
booking_changes
Real number (ℝ)
Zeros 
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.24300874 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 49268 |
| Zeros (%) | 83.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.695205 |
|---|---|
| Coefficient of variation (CV) | 2.860823 |
| Kurtosis | 68.966364 |
| Mean | 0.24300874 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.7511422 |
| Sum | 14312 |
| Variance | 0.48331 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 49268 | |
| 1 | 6755 | 11.5% |
| 2 | 1921 | 3.3% |
| 3 | 553 | 0.9% |
| 4 | 220 | 0.4% |
| 5 | 81 | 0.1% |
| 6 | 42 | 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 7 | < 0.1% |
| Other values (8) | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 49268 | |
| 1 | 6755 | 11.5% |
| 2 | 1921 | 3.3% |
| 3 | 553 | 0.9% |
| 4 | 220 | 0.4% |
| 5 | 81 | 0.1% |
| 6 | 42 | 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 5 | |
| 12 | 1 | < 0.1% |
| 10 | 3 | < 0.1% |
| 9 | 7 | |
| 8 | 11 |
deposit_type
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.8 MiB |
| No Deposit | |
|---|---|
| Non Refund | |
| No Refund | 962 |
| Refundable | 143 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9836658 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Deposit |
|---|---|
| 2nd row | No Deposit |
| 3rd row | No Deposit |
| 4th row | No Deposit |
| 5th row | No Deposit |
Common Values
| Value | Count | Frequency (%) |
| No Deposit | 52333 | |
| Non Refund | 5457 | 9.3% |
| No Refund | 962 | 1.6% |
| Refundable | 143 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 53295 | |
| deposit | 52333 | |
| refund | 6419 | 5.5% |
| non | 5457 | 4.6% |
| refundable | 143 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 111085 | |
| e | 59038 | |
| N | 58752 | |
| 58752 | ||
| s | 52333 | |
| i | 52333 | |
| t | 52333 | |
| p | 52333 | |
| D | 52333 | |
| n | 12019 | 2.0% |
| Other values (7) | 26677 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 411589 | |
| Uppercase Letter | 117647 | 20.0% |
| Space Separator | 58752 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 111085 | |
| e | 59038 | |
| s | 52333 | |
| i | 52333 | |
| t | 52333 | |
| p | 52333 | |
| n | 12019 | 2.9% |
| f | 6562 | 1.6% |
| u | 6562 | 1.6% |
| d | 6562 | 1.6% |
| Other values (3) | 429 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 58752 | |
| D | 52333 | |
| R | 6562 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 58752 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 529236 | |
| Common | 58752 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 111085 | |
| e | 59038 | |
| N | 58752 | |
| s | 52333 | |
| i | 52333 | |
| t | 52333 | |
| p | 52333 | |
| D | 52333 | |
| n | 12019 | 2.3% |
| R | 6562 | 1.2% |
| Other values (6) | 20115 | 3.8% |
Common
| Value | Count | Frequency (%) |
| 58752 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 587988 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 111085 | |
| e | 59038 | |
| N | 58752 | |
| 58752 | ||
| s | 52333 | |
| i | 52333 | |
| t | 52333 | |
| p | 52333 | |
| D | 52333 | |
| n | 12019 | 2.0% |
| Other values (7) | 26677 | 4.5% |
agent
Real number (ℝ)
High correlation  Missing 
| Distinct | 249 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 9132 |
| Missing (%) | 15.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 146.98308 |
| Minimum | 1 |
|---|---|
| Maximum | 535 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 196 |
| Q3 | 240 |
| 95-th percentile | 314 |
| Maximum | 535 |
| Range | 534 |
| Interquartile range (IQR) | 231 |
Descriptive statistics
| Standard deviation | 120.11499 |
|---|---|
| Coefficient of variation (CV) | 0.81720282 |
| Kurtosis | -1.0837932 |
| Mean | 146.98308 |
| Median Absolute Deviation (MAD) | 81 |
| Skewness | 0.11667503 |
| Sum | 7314319 |
| Variance | 14427.61 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 240 | 13907 | |
| 9 | 6997 | |
| 1 | 3184 | 5.4% |
| 250 | 2870 | 4.9% |
| 241 | 1721 | 2.9% |
| 6 | 1378 | 2.3% |
| 40 | 1013 | 1.7% |
| 314 | 927 | 1.6% |
| 242 | 779 | 1.3% |
| 37 | 615 | 1.0% |
| Other values (239) | 16372 | |
| (Missing) | 9132 |
| Value | Count | Frequency (%) |
| 1 | 3184 | |
| 2 | 120 | 0.2% |
| 3 | 564 | 1.0% |
| 5 | 256 | 0.4% |
| 6 | 1378 | 2.3% |
| 7 | 481 | 0.8% |
| 8 | 558 | 0.9% |
| 9 | 6997 | |
| 10 | 39 | 0.1% |
| 11 | 225 | 0.4% |
| Value | Count | Frequency (%) |
| 535 | 3 | < 0.1% |
| 531 | 68 | |
| 527 | 35 | |
| 526 | 10 | < 0.1% |
| 510 | 2 | < 0.1% |
| 508 | 6 | < 0.1% |
| 502 | 24 | < 0.1% |
| 497 | 1 | < 0.1% |
| 495 | 50 | |
| 493 | 35 |
company
Unsupported
Missing  Rejected  Unsupported 
| Missing | 55416 |
|---|---|
| Missing (%) | 94.1% |
| Memory size | 1.8 MiB |
days_in_waiting_list
Real number (ℝ)
Zeros 
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5259789 |
| Minimum | 0 |
|---|---|
| Maximum | 391 |
| Zeros | 56503 |
| Zeros (%) | 95.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 391 |
| Range | 391 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 21.841676 |
|---|---|
| Coefficient of variation (CV) | 6.1945001 |
| Kurtosis | 101.43631 |
| Mean | 3.5259789 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.9202327 |
| Sum | 207659 |
| Variance | 477.05883 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 56503 | |
| 39 | 186 | 0.3% |
| 58 | 164 | 0.3% |
| 31 | 102 | 0.2% |
| 69 | 89 | 0.2% |
| 87 | 80 | 0.1% |
| 63 | 80 | 0.1% |
| 111 | 70 | 0.1% |
| 101 | 65 | 0.1% |
| 77 | 63 | 0.1% |
| Other values (89) | 1492 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 56503 | |
| 1 | 7 | < 0.1% |
| 2 | 2 | < 0.1% |
| 3 | 59 | 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 4 | < 0.1% |
| 6 | 4 | < 0.1% |
| 8 | 6 | < 0.1% |
| 11 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 391 | 15 | < 0.1% |
| 379 | 15 | < 0.1% |
| 330 | 15 | < 0.1% |
| 259 | 10 | < 0.1% |
| 236 | 35 | |
| 224 | 10 | < 0.1% |
| 223 | 60 | |
| 215 | 21 | < 0.1% |
| 207 | 15 | < 0.1% |
| 187 | 45 |
customer_type
Categorical
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 589 |
| Missing (%) | 1.0% |
| Memory size | 3.8 MiB |
| Transient | |
|---|---|
| Transient-Party | |
| Contract | 2486 |
| Group | 312 |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 10.281755 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Transient |
|---|---|
| 2nd row | Transient |
| 3rd row | Transient |
| 4th row | Transient |
| 5th row | Transient |
Common Values
| Value | Count | Frequency (%) |
| Transient | 42430 | |
| Transient-Party | 13078 | 22.2% |
| Contract | 2486 | 4.2% |
| Group | 312 | 0.5% |
| (Missing) | 589 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| transient | 42430 | |
| transient-party | 13078 | 22.4% |
| contract | 2486 | 4.3% |
| group | 312 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 113502 | |
| t | 73558 | |
| r | 71384 | |
| a | 71072 | |
| T | 55508 | |
| s | 55508 | |
| i | 55508 | |
| e | 55508 | |
| y | 13078 | 2.2% |
| - | 13078 | 2.2% |
| Other values (7) | 21784 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 515026 | |
| Uppercase Letter | 71384 | 11.9% |
| Dash Punctuation | 13078 | 2.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 113502 | |
| t | 73558 | |
| r | 71384 | |
| a | 71072 | |
| s | 55508 | |
| i | 55508 | |
| e | 55508 | |
| y | 13078 | 2.5% |
| o | 2798 | 0.5% |
| c | 2486 | 0.5% |
| Other values (2) | 624 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 55508 | |
| P | 13078 | 18.3% |
| C | 2486 | 3.5% |
| G | 312 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13078 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 586410 | |
| Common | 13078 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 113502 | |
| t | 73558 | |
| r | 71384 | |
| a | 71072 | |
| T | 55508 | |
| s | 55508 | |
| i | 55508 | |
| e | 55508 | |
| y | 13078 | 2.2% |
| P | 13078 | 2.2% |
| Other values (6) | 8706 | 1.5% |
Common
| Value | Count | Frequency (%) |
| - | 13078 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 599488 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 113502 | |
| t | 73558 | |
| r | 71384 | |
| a | 71072 | |
| T | 55508 | |
| s | 55508 | |
| i | 55508 | |
| e | 55508 | |
| y | 13078 | 2.2% |
| - | 13078 | 2.2% |
| Other values (7) | 21784 | 3.6% |
adr
Real number (ℝ)
Zeros 
| Distinct | 6769 |
|---|---|
| Distinct (%) | 11.5% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96.250426 |
| Minimum | -6.38 |
|---|---|
| Maximum | 5400 |
| Zeros | 954 |
| Zeros (%) | 1.6% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | -6.38 |
|---|---|
| 5-th percentile | 34.02 |
| Q1 | 60 |
| median | 84 |
| Q3 | 120 |
| 95-th percentile | 207.618 |
| Maximum | 5400 |
| Range | 5406.38 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 58.555599 |
|---|---|
| Coefficient of variation (CV) | 0.60836716 |
| Kurtosis | 1143.6777 |
| Mean | 96.250426 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 13.602849 |
| Sum | 5668572.6 |
| Variance | 3428.7582 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 1786 | 3.0% |
| 75 | 1077 | 1.8% |
| 48 | 1037 | 1.8% |
| 0 | 954 | 1.6% |
| 65 | 940 | 1.6% |
| 60 | 803 | 1.4% |
| 90 | 734 | 1.2% |
| 120 | 703 | 1.2% |
| 80 | 701 | 1.2% |
| 70 | 663 | 1.1% |
| Other values (6759) | 49496 |
| Value | Count | Frequency (%) |
| -6.38 | 1 | < 0.1% |
| 0 | 954 | |
| 0.26 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 1 | 3 | < 0.1% |
| 1.56 | 2 | < 0.1% |
| 1.8 | 1 | < 0.1% |
| 2 | 8 | < 0.1% |
| 2.4 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5400 | 1 | |
| 508 | 1 | |
| 450 | 1 | |
| 437 | 1 | |
| 426.25 | 1 | |
| 402 | 1 | |
| 397.38 | 1 | |
| 392 | 2 | |
| 388 | 2 | |
| 387 | 1 |
required_car_parking_spaces
Categorical
Imbalance  Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 589 |
| Missing (%) | 1.0% |
| Memory size | 3.4 MiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | 25 |
| 8.0 | 2 |
| 3.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 52709 | |
| 1.0 | 5569 | 9.5% |
| 2.0 | 25 | < 0.1% |
| 8.0 | 2 | < 0.1% |
| 3.0 | 1 | < 0.1% |
| (Missing) | 589 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 52709 | |
| 1.0 | 5569 | 9.6% |
| 2.0 | 25 | < 0.1% |
| 8.0 | 2 | < 0.1% |
| 3.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 111015 | |
| . | 58306 | |
| 1 | 5569 | 3.2% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 116612 | |
| Other Punctuation | 58306 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 111015 | |
| 1 | 5569 | 4.8% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 58306 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 174918 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 111015 | |
| . | 58306 | |
| 1 | 5569 | 3.2% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 174918 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 111015 | |
| . | 58306 | |
| 1 | 5569 | 3.2% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
total_of_special_requests
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.51222535 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 37158 |
| Zeros (%) | 63.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7683779 |
|---|---|
| Coefficient of variation (CV) | 1.5000778 |
| Kurtosis | 1.8884772 |
| Mean | 0.51222535 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4897608 |
| Sum | 30167 |
| Variance | 0.59040459 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37158 | |
| 1 | 14711 | 25.0% |
| 2 | 5796 | 9.8% |
| 3 | 1066 | 1.8% |
| 4 | 149 | 0.3% |
| 5 | 14 | < 0.1% |
| (Missing) | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 37158 | |
| 1 | 14711 | 25.0% |
| 2 | 5796 | 9.8% |
| 3 | 1066 | 1.8% |
| 4 | 149 | 0.3% |
| 5 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 14 | < 0.1% |
| 4 | 149 | 0.3% |
| 3 | 1066 | 1.8% |
| 2 | 5796 | 9.8% |
| 1 | 14711 | 25.0% |
| 0 | 37158 |
reservation_status
Categorical
High correlation  Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 589 |
| Missing (%) | 1.0% |
| Memory size | 3.7 MiB |
| Check-Out | |
|---|---|
| Canceled | |
| No-Show | 797 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.574452 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Check-Out |
|---|---|
| 2nd row | Check-Out |
| 3rd row | Check-Out |
| 4th row | Check-Out |
| 5th row | Check-Out |
Common Values
| Value | Count | Frequency (%) |
| Check-Out | 34291 | |
| Canceled | 23218 | |
| No-Show | 797 | 1.4% |
| (Missing) | 589 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| check-out | 34291 | |
| canceled | 23218 | |
| no-show | 797 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 80727 | |
| C | 57509 | |
| c | 57509 | |
| h | 35088 | |
| - | 35088 | |
| u | 34291 | |
| t | 34291 | |
| O | 34291 | |
| k | 34291 | |
| a | 23218 | 4.6% |
| Other values (7) | 73639 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 371460 | |
| Uppercase Letter | 93394 | 18.7% |
| Dash Punctuation | 35088 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 80727 | |
| c | 57509 | |
| h | 35088 | |
| u | 34291 | |
| t | 34291 | |
| k | 34291 | |
| a | 23218 | 6.3% |
| n | 23218 | 6.3% |
| l | 23218 | 6.3% |
| d | 23218 | 6.3% |
| Other values (2) | 2391 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 57509 | |
| O | 34291 | |
| N | 797 | 0.9% |
| S | 797 | 0.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 35088 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 464854 | |
| Common | 35088 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 80727 | |
| C | 57509 | |
| c | 57509 | |
| h | 35088 | |
| u | 34291 | |
| t | 34291 | |
| O | 34291 | |
| k | 34291 | |
| a | 23218 | 5.0% |
| n | 23218 | 5.0% |
| Other values (6) | 50421 |
Common
| Value | Count | Frequency (%) |
| - | 35088 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 499942 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 80727 | |
| C | 57509 | |
| c | 57509 | |
| h | 35088 | |
| - | 35088 | |
| u | 34291 | |
| t | 34291 | |
| O | 34291 | |
| k | 34291 | |
| a | 23218 | 4.6% |
| Other values (7) | 73639 |
| Distinct | 921 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 460.2 KiB |
| Minimum | 2014-11-18 00:00:00 |
|---|---|
| Maximum | 2017-09-14 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
kids
Real number (ℝ)
Missing 
| Distinct | 41 |
|---|---|
| Distinct (%) | 20.4% |
| Missing | 58694 |
| Missing (%) | 99.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.41791 |
| Minimum | -1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 105 |
| Negative (%) | 0.2% |
| Memory size | 460.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | -1 |
| Q3 | 65 |
| 95-th percentile | 96 |
| Maximum | 100 |
| Range | 101 |
| Interquartile range (IQR) | 66 |
Descriptive statistics
| Standard deviation | 38.571032 |
|---|---|
| Coefficient of variation (CV) | 1.1206674 |
| Kurtosis | -1.6413267 |
| Mean | 34.41791 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.31316079 |
| Sum | 6918 |
| Variance | 1487.7245 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 105 | 0.2% |
| 62 | 6 | < 0.1% |
| 60 | 5 | < 0.1% |
| 63 | 5 | < 0.1% |
| 58 | 5 | < 0.1% |
| 57 | 5 | < 0.1% |
| 100 | 3 | < 0.1% |
| 80 | 3 | < 0.1% |
| 98 | 3 | < 0.1% |
| 59 | 3 | < 0.1% |
| Other values (31) | 58 | 0.1% |
| (Missing) | 58694 |
| Value | Count | Frequency (%) |
| -1 | 105 | |
| 52 | 2 | < 0.1% |
| 53 | 3 | < 0.1% |
| 54 | 1 | < 0.1% |
| 55 | 1 | < 0.1% |
| 56 | 2 | < 0.1% |
| 57 | 5 | < 0.1% |
| 58 | 5 | < 0.1% |
| 59 | 3 | < 0.1% |
| 60 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 3 | |
| 98 | 3 | |
| 97 | 3 | |
| 96 | 3 | |
| 95 | 3 | |
| 93 | 1 | < 0.1% |
| 92 | 1 | < 0.1% |
| 89 | 2 | |
| 88 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
Interactions
Correlations
| adr | adults | agent | arrival_date_day_of_month | arrival_date_month | arrival_date_week_number | arrival_date_year | assigned_room_type | babies | booking_changes | children | customer_type | days_in_waiting_list | deposit_type | distribution_channel | hotel | is_canceled | is_repeated_guest | kids | lead_time | market_segment | meal | previous_bookings_not_canceled | previous_cancellations | required_car_parking_spaces | reservation_status | reserved_room_type | stays_in_week_nights | stays_in_weekend_nights | total_of_special_requests | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| adr | 1.000 | 0.311 | 0.003 | 0.032 | 0.004 | 0.110 | 0.039 | 0.000 | 0.037 | -0.008 | 0.000 | 0.000 | 0.026 | 0.011 | 0.000 | 0.000 | 0.000 | 0.000 | 0.035 | 0.099 | 0.000 | 0.000 | -0.146 | -0.082 | 0.000 | 0.000 | 0.000 | 0.150 | 0.099 | 0.131 |
| adults | 0.311 | 1.000 | 0.006 | 0.005 | 0.005 | 0.043 | 0.013 | 0.000 | 0.030 | -0.048 | 0.000 | 0.104 | -0.032 | 0.000 | 0.002 | 0.008 | 0.013 | 0.000 | -0.028 | 0.178 | 0.007 | 0.000 | -0.215 | -0.024 | 0.000 | 0.006 | 0.000 | 0.151 | 0.136 | 0.140 |
| agent | 0.003 | 0.006 | 1.000 | 0.007 | 0.121 | -0.092 | 0.230 | 0.142 | 0.035 | 0.141 | 0.080 | 0.199 | -0.135 | 0.177 | 0.195 | 0.817 | 0.271 | 0.073 | -0.233 | -0.099 | 0.300 | 0.213 | 0.061 | 0.028 | 0.136 | 0.194 | 0.152 | 0.155 | 0.148 | 0.239 |
| arrival_date_day_of_month | 0.032 | 0.005 | 0.007 | 1.000 | 0.067 | 0.053 | 0.038 | 0.015 | 0.000 | 0.006 | 0.012 | 0.034 | 0.031 | 0.061 | 0.038 | 0.050 | 0.019 | 0.015 | -0.035 | -0.006 | 0.041 | 0.050 | 0.006 | -0.024 | 0.012 | 0.021 | 0.016 | -0.018 | -0.016 | 0.005 |
| arrival_date_month | 0.004 | 0.005 | 0.121 | 0.067 | 1.000 | 0.796 | 0.346 | 0.044 | 0.000 | 0.014 | 0.079 | 0.124 | 0.086 | 0.125 | 0.096 | 0.234 | 0.178 | 0.123 | 0.000 | 0.135 | 0.117 | 0.112 | 0.022 | 0.045 | 0.031 | 0.140 | 0.058 | 0.050 | 0.066 | 0.073 |
| arrival_date_week_number | 0.110 | 0.043 | -0.092 | 0.053 | 0.796 | 1.000 | 0.343 | 0.048 | 0.008 | -0.002 | 0.071 | 0.117 | 0.028 | 0.119 | 0.094 | 0.250 | 0.185 | 0.127 | -0.091 | 0.110 | 0.105 | 0.100 | -0.069 | 0.050 | 0.030 | 0.143 | 0.054 | 0.024 | 0.020 | 0.015 |
| arrival_date_year | 0.039 | 0.013 | 0.230 | 0.038 | 0.346 | 0.343 | 1.000 | 0.097 | 0.000 | 0.024 | 0.046 | 0.127 | 0.068 | 0.095 | 0.039 | 0.369 | 0.212 | 0.101 | 0.000 | 0.112 | 0.108 | 0.102 | 0.037 | 0.047 | 0.033 | 0.150 | 0.115 | 0.036 | 0.059 | 0.099 |
| assigned_room_type | 0.000 | 0.000 | 0.142 | 0.015 | 0.044 | 0.048 | 0.097 | 1.000 | 0.000 | 0.079 | 0.327 | 0.099 | 0.042 | 0.169 | 0.106 | 0.382 | 0.263 | 0.083 | 0.000 | 0.061 | 0.132 | 0.098 | 0.011 | 0.018 | 0.096 | 0.188 | 0.777 | 0.054 | 0.080 | 0.085 |
| babies | 0.037 | 0.030 | 0.035 | 0.000 | 0.000 | 0.008 | 0.000 | 0.000 | 1.000 | 0.106 | 0.000 | 0.000 | -0.019 | 0.006 | 0.000 | 0.000 | 0.000 | 0.000 | 0.048 | -0.006 | 0.000 | 0.000 | -0.017 | -0.010 | 0.021 | 0.000 | 0.000 | 0.030 | 0.027 | 0.100 |
| booking_changes | -0.008 | -0.048 | 0.141 | 0.006 | 0.014 | -0.002 | 0.024 | 0.079 | 0.106 | 1.000 | 0.022 | 0.035 | -0.023 | 0.022 | 0.030 | 0.040 | 0.057 | 0.000 | -0.025 | 0.029 | 0.023 | 0.018 | 0.023 | -0.030 | 0.018 | 0.040 | 0.014 | 0.097 | 0.060 | 0.056 |
| children | 0.000 | 0.000 | 0.080 | 0.012 | 0.079 | 0.071 | 0.046 | 0.327 | 0.000 | 0.022 | 1.000 | 0.061 | 0.022 | 0.057 | 0.043 | 0.060 | 0.044 | 0.032 | 0.000 | 0.025 | 0.106 | 0.030 | 0.000 | 0.000 | 0.031 | 0.040 | 0.386 | 0.007 | 0.034 | 0.049 |
| customer_type | 0.000 | 0.104 | 0.199 | 0.034 | 0.124 | 0.117 | 0.127 | 0.099 | 0.000 | 0.035 | 0.061 | 1.000 | 0.105 | 0.114 | 0.095 | 0.109 | 0.196 | 0.151 | 0.189 | 0.076 | 0.311 | 0.126 | 0.035 | 0.006 | 0.058 | 0.139 | 0.123 | 0.108 | 0.130 | 0.114 |
| days_in_waiting_list | 0.026 | -0.032 | -0.135 | 0.031 | 0.086 | 0.028 | 0.068 | 0.042 | -0.019 | -0.023 | 0.022 | 0.105 | 1.000 | 0.113 | 0.035 | 0.222 | 0.061 | 0.027 | 0.050 | 0.183 | 0.092 | 0.062 | -0.036 | -0.028 | 0.041 | 0.047 | 0.037 | -0.003 | -0.099 | -0.137 |
| deposit_type | 0.011 | 0.000 | 0.177 | 0.061 | 0.125 | 0.119 | 0.095 | 0.169 | 0.006 | 0.022 | 0.057 | 0.114 | 0.113 | 1.000 | 0.075 | 0.310 | 0.411 | 0.061 | 0.000 | 0.227 | 0.275 | 0.068 | 0.013 | 0.063 | 0.068 | 0.298 | 0.127 | 0.045 | 0.065 | 0.154 |
| distribution_channel | 0.000 | 0.002 | 0.195 | 0.038 | 0.096 | 0.094 | 0.039 | 0.106 | 0.000 | 0.030 | 0.043 | 0.095 | 0.035 | 0.075 | 1.000 | 0.234 | 0.203 | 0.214 | 0.034 | 0.116 | 0.668 | 0.065 | 0.111 | 0.036 | 0.080 | 0.148 | 0.120 | 0.014 | 0.071 | 0.078 |
| hotel | 0.000 | 0.008 | 0.817 | 0.050 | 0.234 | 0.250 | 0.369 | 0.382 | 0.000 | 0.040 | 0.060 | 0.109 | 0.222 | 0.310 | 0.234 | 1.000 | 0.396 | 0.121 | 0.307 | 0.154 | 0.224 | 0.284 | 0.060 | 0.034 | 0.203 | 0.397 | 0.315 | 0.142 | 0.174 | 0.221 |
| is_canceled | 0.000 | 0.013 | 0.271 | 0.019 | 0.178 | 0.185 | 0.212 | 0.263 | 0.000 | 0.057 | 0.044 | 0.196 | 0.061 | 0.411 | 0.203 | 0.396 | 1.000 | 0.125 | 0.112 | 0.242 | 0.232 | 0.140 | 0.068 | 0.058 | 0.272 | 1.000 | 0.090 | 0.051 | 0.039 | 0.216 |
| is_repeated_guest | 0.000 | 0.000 | 0.073 | 0.015 | 0.123 | 0.127 | 0.101 | 0.083 | 0.000 | 0.000 | 0.032 | 0.151 | 0.027 | 0.061 | 0.214 | 0.121 | 0.125 | 1.000 | 0.000 | 0.127 | 0.269 | 0.057 | 0.314 | 0.067 | 0.080 | 0.125 | 0.036 | 0.022 | 0.085 | 0.065 |
| kids | 0.035 | -0.028 | -0.233 | -0.035 | 0.000 | -0.091 | 0.000 | 0.000 | 0.048 | -0.025 | 0.000 | 0.189 | 0.050 | 0.000 | 0.034 | 0.307 | 0.112 | 0.000 | 1.000 | 0.086 | 0.142 | 0.141 | -0.104 | -0.011 | 0.000 | 0.075 | 0.000 | -0.024 | -0.029 | -0.164 |
| lead_time | 0.099 | 0.178 | -0.099 | -0.006 | 0.135 | 0.110 | 0.112 | 0.061 | -0.006 | 0.029 | 0.025 | 0.076 | 0.183 | 0.227 | 0.116 | 0.154 | 0.242 | 0.127 | 0.086 | 1.000 | 0.176 | 0.091 | -0.194 | 0.084 | 0.071 | 0.181 | 0.043 | 0.399 | 0.252 | -0.071 |
| market_segment | 0.000 | 0.007 | 0.300 | 0.041 | 0.117 | 0.105 | 0.108 | 0.132 | 0.000 | 0.023 | 0.106 | 0.311 | 0.092 | 0.275 | 0.668 | 0.224 | 0.232 | 0.269 | 0.142 | 0.176 | 1.000 | 0.179 | 0.096 | 0.041 | 0.106 | 0.175 | 0.148 | 0.049 | 0.083 | 0.203 |
| meal | 0.000 | 0.000 | 0.213 | 0.050 | 0.112 | 0.100 | 0.102 | 0.098 | 0.000 | 0.018 | 0.030 | 0.126 | 0.062 | 0.068 | 0.065 | 0.284 | 0.140 | 0.057 | 0.141 | 0.091 | 0.179 | 1.000 | 0.017 | 0.088 | 0.030 | 0.105 | 0.076 | 0.048 | 0.077 | 0.053 |
| previous_bookings_not_canceled | -0.146 | -0.215 | 0.061 | 0.006 | 0.022 | -0.069 | 0.037 | 0.011 | -0.017 | 0.023 | 0.000 | 0.035 | -0.036 | 0.013 | 0.111 | 0.060 | 0.068 | 0.314 | -0.104 | -0.194 | 0.096 | 0.017 | 1.000 | 0.122 | 0.024 | 0.047 | 0.008 | -0.122 | -0.100 | 0.023 |
| previous_cancellations | -0.082 | -0.024 | 0.028 | -0.024 | 0.045 | 0.050 | 0.047 | 0.018 | -0.010 | -0.030 | 0.000 | 0.006 | -0.028 | 0.063 | 0.036 | 0.034 | 0.058 | 0.067 | -0.011 | 0.084 | 0.041 | 0.088 | 0.122 | 1.000 | 0.001 | 0.042 | 0.012 | 0.006 | 0.004 | -0.035 |
| required_car_parking_spaces | 0.000 | 0.000 | 0.136 | 0.012 | 0.031 | 0.030 | 0.033 | 0.096 | 0.021 | 0.018 | 0.031 | 0.058 | 0.041 | 0.068 | 0.080 | 0.203 | 0.272 | 0.080 | 0.000 | 0.071 | 0.106 | 0.030 | 0.024 | 0.001 | 1.000 | 0.193 | 0.079 | 0.026 | 0.026 | 0.060 |
| reservation_status | 0.000 | 0.006 | 0.194 | 0.021 | 0.140 | 0.143 | 0.150 | 0.188 | 0.000 | 0.040 | 0.040 | 0.139 | 0.047 | 0.298 | 0.148 | 0.397 | 1.000 | 0.125 | 0.075 | 0.181 | 0.175 | 0.105 | 0.047 | 0.042 | 0.193 | 1.000 | 0.065 | 0.042 | 0.031 | 0.155 |
| reserved_room_type | 0.000 | 0.000 | 0.152 | 0.016 | 0.058 | 0.054 | 0.115 | 0.777 | 0.000 | 0.014 | 0.386 | 0.123 | 0.037 | 0.127 | 0.120 | 0.315 | 0.090 | 0.036 | 0.000 | 0.043 | 0.148 | 0.076 | 0.008 | 0.012 | 0.079 | 0.065 | 1.000 | 0.047 | 0.064 | 0.089 |
| stays_in_week_nights | 0.150 | 0.151 | 0.155 | -0.018 | 0.050 | 0.024 | 0.036 | 0.054 | 0.030 | 0.097 | 0.007 | 0.108 | -0.003 | 0.045 | 0.014 | 0.142 | 0.051 | 0.022 | -0.024 | 0.399 | 0.049 | 0.048 | -0.122 | 0.006 | 0.026 | 0.042 | 0.047 | 1.000 | 0.429 | 0.103 |
| stays_in_weekend_nights | 0.099 | 0.136 | 0.148 | -0.016 | 0.066 | 0.020 | 0.059 | 0.080 | 0.027 | 0.060 | 0.034 | 0.130 | -0.099 | 0.065 | 0.071 | 0.174 | 0.039 | 0.085 | -0.029 | 0.252 | 0.083 | 0.077 | -0.100 | 0.004 | 0.026 | 0.031 | 0.064 | 0.429 | 1.000 | 0.105 |
| total_of_special_requests | 0.131 | 0.140 | 0.239 | 0.005 | 0.073 | 0.015 | 0.099 | 0.085 | 0.100 | 0.056 | 0.049 | 0.114 | -0.137 | 0.154 | 0.078 | 0.221 | 0.216 | 0.065 | -0.164 | -0.071 | 0.203 | 0.053 | 0.023 | -0.035 | 0.060 | 0.155 | 0.089 | 0.103 | 0.105 | 1.000 |
Missing values
Sample
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | kids | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Resort Hotel | 0 | 342 | 2015.0 | July | 27 | 1 | 0 | 0 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 3 | No Deposit | NaN | NaN | 0.0 | Transient | 0.0 | 0.0 | 0.0 | Check-Out | 2015-07-01 | NaN |
| 1 | Resort Hotel | 0 | 737 | 2015.0 | July | 27 | 1 | 0 | 0 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 4 | No Deposit | NaN | NaN | 0.0 | Transient | 0.0 | 0.0 | 0.0 | Check-Out | 2015-07-01 | NaN |
| 2 | Resort Hotel | 0 | 7 | 2015.0 | July | 27 | 1 | 0 | 1 | 1 | 0.0 | 0 | BB | GBR | Direct | Direct | 0 | 0 | 0 | A | C | 0 | No Deposit | NaN | NaN | 0.0 | Transient | 75.0 | 0.0 | 0.0 | Check-Out | 2015-07-02 | NaN |
| 3 | Resort Hotel | 0 | 13 | 2015.0 | July | 27 | 1 | 0 | 1 | 1 | 0.0 | 0 | BB | GBR | Corporate | Corporate | 0 | 0 | 0 | A | A | 0 | No Deposit | 304.0 | NaN | 0.0 | Transient | 75.0 | 0.0 | 0.0 | Check-Out | 2015-07-02 | NaN |
| 4 | Resort Hotel | 0 | 14 | 2015.0 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | 0 | BB | GBR | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0.0 | Transient | 98.0 | 0.0 | 1.0 | Check-Out | 2015-07-03 | NaN |
| 5 | Resort Hotel | 0 | 14 | 2015.0 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | 0 | BB | GBR | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0.0 | Transient | 98.0 | 0.0 | 1.0 | Check-Out | 2015-07-03 | NaN |
| 6 | Resort Hotel | 0 | 0 | 2015.0 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 0 | No Deposit | NaN | NaN | 0.0 | Transient | 107.0 | 0.0 | 0.0 | Check-Out | 2015-07-03 | NaN |
| 7 | Resort Hotel | 0 | 9 | 2015.0 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | 0 | FB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 0 | No Deposit | 303.0 | NaN | 0.0 | Transient | 103.0 | 0.0 | 1.0 | Check-Out | 2015-07-03 | NaN |
| 8 | Resort Hotel | 1 | 85 | 2015.0 | July | 27 | 1 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0.0 | Transient | 82.0 | 0.0 | 1.0 | Canceled | 2015-05-06 | NaN |
| 9 | Resort Hotel | 1 | 75 | 2015.0 | July | 27 | 1 | 0 | 3 | 2 | 0.0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | D | D | 0 | No Deposit | 15.0 | NaN | 0.0 | Transient | 105.5 | 0.0 | 0.0 | Canceled | 2015-04-22 | NaN |
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | kids | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 58885 | City Hotel | 1 | 605 | 2016.0 | October | 43 | 17 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | NaN | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | NaN |
| 58886 | City Hotel | 1 | 605 | 2016.0 | October | 43 | 17 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | NaN | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | NaN |
| 58887 | City Hotel | 1 | 605 | 2016.0 | October | 43 | 17 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | NaN | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | NaN |
| 58888 | City Hotel | 1 | 605 | 2016.0 | October | 43 | 17 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | NaN | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | NaN |
| 58889 | City Hotel | 1 | 605 | 2016.0 | October | 43 | 17 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | No Refund | 1.0 | NU | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 58890 | Resort Hotel | 0 | 3 | 2016.0 | April | 16 | 11 | 1 | 0 | 1 | 0.0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0.0 | Transient-Party | 56.00 | 0.0 | 1.0 | Check-Out | 2016-04-12 | NaN |
| 58891 | Resort Hotel | 1 | 158 | 2016.0 | May | 20 | 8 | 2 | 2 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | F | F | 2 | No Deposit | 250.0 | NaN | 0.0 | Transient | 83.05 | 0.0 | 1.0 | Canceled | 2016-01-21 | NaN |
| 58892 | City Hotel | 1 | 18 | 2016.0 | August | 32 | 6 | 2 | 2 | 2 | 0.0 | 0 | BB | ESP | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0.0 | Transient | 151.00 | 0.0 | 2.0 | Canceled | 2016-07-28 | NaN |
| 58893 | Resort Hotel | 1 | 383 | 2016.0 | October | 41 | 6 | 1 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 315.0 | NaN | 0.0 | Transient-Party | 48.00 | 0.0 | 0.0 | Canceled | 2016-03-04 | NaN |
| 58894 | City Hotel | 1 | 185 | 2016.0 | July | 28 | 5 | 0 | 4 | 2 | 0.0 | 0 | BB | DEU | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0.0 | Transient | 90.95 | 0.0 | 1.0 | Canceled | 2016-05-31 | NaN |
Duplicate rows
Most frequently occurring
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | kids | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1309 | City Hotel | 1 | 188 | 2016.0 | June | 25 | 15 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 119.0 | 39.0 | Transient | 130.0 | 0.0 | 0.0 | Canceled | 2016-01-18 | NaN | 91 |
| 1219 | City Hotel | 1 | 158 | 2016.0 | May | 22 | 24 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 37.0 | 31.0 | Transient | 130.0 | 0.0 | 0.0 | Canceled | 2016-01-18 | NaN | 77 |
| 733 | City Hotel | 1 | 37 | 2016.0 | October | 42 | 13 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 56.0 | 0.0 | Transient-Party | 105.0 | 0.0 | 0.0 | Canceled | 2016-09-06 | NaN | 75 |
| 740 | City Hotel | 1 | 39 | 2015.0 | August | 33 | 14 | 0 | 2 | 2 | 0.0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 6.0 | 0.0 | Transient-Party | 101.5 | 0.0 | 0.0 | Canceled | 2015-07-06 | NaN | 68 |
| 886 | City Hotel | 1 | 71 | 2016.0 | June | 25 | 14 | 0 | 3 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 236.0 | 0.0 | Transient | 120.0 | 0.0 | 0.0 | Canceled | 2016-04-27 | NaN | 68 |
| 550 | City Hotel | 1 | 1 | 2016.0 | February | 10 | 28 | 2 | 1 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 134.0 | 0.0 | Transient-Party | 60.0 | 0.0 | 0.0 | Canceled | 2016-02-27 | NaN | 63 |
| 960 | City Hotel | 1 | 87 | 2015.0 | September | 39 | 25 | 2 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | 0.0 | Transient | 170.0 | 0.0 | 0.0 | Canceled | 2015-09-09 | NaN | 59 |
| 812 | City Hotel | 1 | 56 | 2016.0 | June | 24 | 8 | 0 | 1 | 2 | 0.0 | 0 | BB | PRT | Offline TA/TO | Corporate | 0 | 0 | 0 | A | A | 0 | No Deposit | 191.0 | 0.0 | Transient-Party | 120.0 | 0.0 | 0.0 | Canceled | 2016-06-02 | NaN | 55 |
| 905 | City Hotel | 1 | 74 | 2015.0 | September | 38 | 18 | 0 | 2 | 2 | 0.0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 6.0 | 0.0 | Transient-Party | 101.5 | 0.0 | 0.0 | Canceled | 2015-07-06 | NaN | 54 |
| 1055 | City Hotel | 1 | 105 | 2016.0 | April | 15 | 6 | 0 | 1 | 2 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 12.0 | 0.0 | Transient | 75.0 | 0.0 | 0.0 | Canceled | 2016-01-18 | NaN | 52 |